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UMIT I§LAK 

Abstract. This paper studies statistics of riffle shuffles by relating them to 
random word statistics with the use of inverse shuffles. Asymptotic normality 
of the number of descents and inversions in riffle shuffles with convergence 
rates of order in the Kolmogorov distance are proven. Results are also 

given about the lengths of the longest alternating subsequences of random 
permutations resulting from riffle shuffles. A sketch of how the theory of 
multisets can be useful for statistics of a variation of top m to random shuffles 
is presented. 

1. Introduction 

For a sequence x = (xi, x n ) of real numbers, the number of descents and 
inversions are defined as des(x) = J27=i ^-( x i > x i+i) an d wu(x) = J2i<j ^-( x i > 
Xj), respectively. For a permutation tt in the symmetric group S n , we write des^Tt) 
for the number of descents in the sequence (7r(l), 7r(2), 7r(n)). Similar notation 
will be used for inversions and other permutation statistics. In this paper, we 
will analyze des(p) and inv(p) when p is a random permutation with riffle shuffle 
distribution (which is defined in the next section precisely), and will discuss some 
other related problems. 

Our interest in descent-inversion statistics in riffle shuffles started with the fol- 
lowing elementary observation for uniformly random permutations: Let tt be a 
uniformly random permutation in S n and X = (X\, ...,X n ) be a random vector 
where AVs are independent and identically distributed (i.i.d.) (7(0, 1) random vari- 
ables. For i = f , n, let Ri and R[ be the ranks of n(i) and Xi in (tt(1), 7r(n)) 
and (Xi, X n ) respectively. Then (R±, R n ) =d (R[, R' n ) where =d denotes 
equality in distribution. 

This simple result, which can be proven by a simple induction (or, by a measure 
theoretic argument as in makes it easier to study problems regarding uniform 
permutation statistics by transforming them into independent t/(0, 1) random vari- 
able statistics. As an example, we have inv(w) =<j Yli<j IpQ > Xj) gi vm g an 
alternative representation of inv(n) that can be quite useful for asymptotic prob- 
lems. 

A natural question at this point is: What would J2i<j lO^i > Xj) represent if 
X-s were instead i.i.d. over [a] := {1, a} with distribution p = (pi, ...,p a ) where 
a > 2? Recently, Bliem and Kousidis [3J and Janson [T^] considered this problem in 
terms of the generalized Galois numbers and provided several different probabilistic 
explanations. 

In this paper, we give a different interpretation of this using random permu- 
tations which is analogous to the discussion given above for uniformly random 
permutations. This time, the equivalent distribution turns out to be a biased riffle 
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shuffle with a hands. Using this transformation, we are able to obtain asymptotic 
normality of the number of inversions in riffle shuffles (which was questioned in [8] , 
pg 10) with convergence rates, and also understand some other related statistics. 

The organization of this paper is as follows. Section 2 provides background in 
riffle shuffles and makes the connection to random words using inverse shuffles. It 
also discusses how similar results can be obtained for a variation of top m to random 
shuffles. Section 3 treats the asymptotic distribution of the number of descents and 
inversions in riffle shuffles. Section 4 provides asymptotic results for the lengths 
of longest alternating subsequences in uniformly random permutations and riffle 
shuffles. 

2. Riffle shuffles and connection to random words 

The method most often used to shuffle a deck of cards is the following: first, cut 
the deck into two piles and then riffle the piles together, that is, drop the cards 
from the bottom of each pile to form a new pile. The first mathematical models for 
riffle shuffles were introduced |9j and {16]. These were further developed in (2] and 
(8|. Now following (8], we will give two equivalent descriptions of riffle shuffles in 
the most general sense. For other alternative descriptions (which will not be used 
in this paper), see pQ and [5]. 

Description 1 : Cut the n card deck into a piles by picking pile sizes according 
to the mult(a; p) distribution, where p = (pi, ...,p a ). That is, choose b\, b a with 
probability 

Then choose uniformly one of the ( bl n b ) ways of interleaving the packets, leaving 
the cards in each pile in their original order. 

Definition 2.1. The probability distribution on S n resulting from Description 
1 will be called as the riffle shuffle distribution and will be denoted by P n ,a,v 
When p = (1/a, 1/a, 1/a), the shuffle is said to be unbiased and the resulting 
probability measure is denoted by P n . a - Otherwise, shuffle is said to be biased. 

Note that the usual way of shuffling n cards corresponds to P n ,2 (assuming that 
the shuffler is not cheating). Before moving on to Description 2, let's give an 
example using unbiased 2-shuffles. The permutation 

/1234567\ 
pna "^1253674^ 

is a possible outcome of the P n ,2 distribution. Here the first four cards form the 
first pile, the last three form the second one and these two piles are riffled together. 

The following alternative description will be important in the following discus- 
sion. 

Description 2 : (Inverse a-shuffles) The inverse of a biased a-shuffle has the 
following description. Assign independent random digits from {l,...,a} to each 
card with distribution p = (j>±, ...,p a ). Then sort according to digit, preserving 
relative order for cards with the same digit. 
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In other words, if a is generated according to Description 2, then a <~ P n ,a,p- A 
proof of the equivalence of these two descriptions (with two other formulations) for 
unbiased shuffles can be found in [2]. Extension to biased case is straightforward. 
Now let's give an example of generating a random permutation with distribution 
Pn,2 using inverse shuffles. 

Consider a deck of 7 cards. We wish to shuffle this deck with the unbiased 2- 
shufflc distribution using inverse shuffles. Let X = (X\, X n ) = (1, 1, 2, 1, 2, 2, 1) 
be a sample from f/({l,2} 7 ). Then, sorting according to digits preserving rela- 
tive order for cards with the same digit gives the new configuration of cards as 
(1,2,4,7,3,5,6). In the usual permutation notation, the resulting permutation 
after the inverse shuffle is 

'1 2 3 4 5 6 7 
1 2 4 7 3 5 6 



and the resulting sample from P n ,i is 

Pn,2 ■= O-" 1 = 



1 2 3 4 5 6 7 
1 2 5 3 6 7 4 

In the following, we will sometimes call the random vector X = (Xi, X n ) 
where X^s are independent with distribution p = (pi, ...,p a ) as a random word. 

Next we formalize the relation between riffle shuffles and random words. Let 
Pn,a,p be a random permutation with distribution P n , a ,p that is generated using 
inverse shuffles with the random word X = (Xi, X n ) and observe that 

A»,a, P (i) = #{j : -V, < Xi} + #{j • /:.Y, .V,|, 

Thus for i, k € [n], we have p n<at p(i) > p n ,a,p(k) if and only if 

•••• {./ : -V, < A M • ,•{./ : j < i, Xj = \,\ > #{j : X } < X,\ ■ |./ : j < k, X, = X k }. 

Using this, for the case i < k, we immediately arrive at the following important 
lemma. 

Lemma 2.2. Let X = (X\, ...,X n ) where Xi's are independent with distribution 
p = (pi, ...,p a ). Also let p„ t a,p be the corresponding permutation as described above 
so that p nt a,p has distribution P n . a . p - Then for i < k, p n>a>p (i) > p n ,ap(k) if and 
only if Xi > X k . 

This has the following corollary: 

Corollary 2.3. Consider the setting in Lemma Wlh and let S C {(i,j) € [n] x [n] : 
i < j}. Then 

HPn.cpii) > Pn,a,p{j)) = 1 ^ > 

(hj)es (ij)es 

In the following two sections, we will make use of this connection to study various 
statistics of riffle shuffles. Before that, we demonstrate the use of random words 
approach with two other examples. The first one will be relating riffle shuffles to 
uniformly random permutations and the second one will give a different interpreta- 
tion of a variation of top to random shuffles. As a general remark, we note that the 
results in this paper will be mostly given for unbiased shuffles to keep the notations 
simple. However, all the results in this paper are extendible to the biased case in a 
straightforward way. 
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We start with a total variation result relating riffle shuffle statistics and uniform 
permutation statistics. Although the result is given for des and inv, it is much 
more general as can be seen from the proof easily. 

Theorem 2.4. Let p n ^ a and n be random permutations in S n with unbiased a—shuffle 
distribution and uniform distribution, respectively. If f = des or f — inv, then for 
any a > n, 

d T v{f{Pn,a)J(-K))<l-J-^-T-^. 

[a — ny. ar 

In particular, c?Tv(/(Pn,a), /M) as a — > oo. 

Proof. Let 7r be a uniformly random permutation and p n _ a be a random permutation 
with distribution P n . a that is generated using inverse shuffling with the random 
vector X = (X\, ...,X n ). Also let T be the number of different digits in the vector 
X. Then we have 

nf(Pn.a)eA) = V(f(p n , a )eA,T = n)+V(f( Pn , a )eA,T<n) 

= nf(Pn.a) G A\T = n)P(T = n) + P(/(p„,a) eA,T<n) 
< ¥(f(n)eA)+P(f(p n , a )eA,T<n) (2.1) 

where (I2.ip follows by observing P(/(p n ,a) G A\T = n) = V(tt € A) since p n , a has 
uniform distribution conditional on T — n. This yields 

P(/(p«,a) & A) - P(/(vr) € A) < P(/(p n , Q ) ei,T<«)<P(T<«). (2.2) 

Similarly, we have 

P(/(tt) G A) = P(/(tt) G A)P(T = n) + P(/(tt) G A)P(T < n) 

< P(p„, n G A)+P(T<n) 

implying 

P(/(tt) P(/(pn,a) G A) < P(T > n). (2.3) 

Hence combining (|2.2[) and (|2.3[) . for a > n, we get 

d T v(f(Pn,a), /W) < P(T < ») = P [ (J{A, = X,} j = 1 - P j p|{JQ ^ Xj-} 



( a )> 

n 

a! 1 



a 

1 



(a — n)! a n 

proving the first claim. The second assertion is immediate from the bound we 
obtained. □ 

Remark 2.5. As can be seen easily from the proof, the result is actually true for a 
large class of functions /. 



Remark 2.6. Note that Theorem l2.4l is also informative for understanding multiple 
2-shuffles by a nice convolution property of riffle shuffles given by Fulman [8 J . Let- 
ting p = (pi, ...,p a ), p' = (p[, ■■■,p' b ) be two probability measures and defining the 
product ® by p ® p' = (pip'i, ...,pip' b , ...,p a p[, ...,p a p b ), Fulman's result gives that 
the convolution of P n ,a,p and P n ,a,p' is Pn,ab,p®p' ■ In particular, when a = b = 2 
and pi = P2 = 1/2, the case of multiple 2-shuffles is handled. 
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Since convergence in total variation implies convergence in distribution, we also 
have 

Corollary 2.7. If the shuffle is unbiased, then f(p n ,a) — >d /(f) as a — > oo. 

We close this section by describing how one can use above ideas to study a 
variation of top m to random shuffles which was first introduced in [7^. Consider 
a deck of n cards and let < m < n be fixed. Now cut off the top to cards and 
insert them randomly among the remaining n — m cards, keeping both packets in 
the same relative order. We will call this shuffling method as ordered top to to 
random shuffles. 

An ordered top to to random shuffle is actually equivalent to a 2-shuffle in which 
exactly to cards are cut off (whereas for the 2-shuffles case, to is a binomial random 
variable). It is not hard to see that the following result gives an inverse description 
of ordered top to to random shuffles. 

Theorem 2.8. The inverse of an ordered top to to random shuffle has the fol- 
lowing description. Assign card i G [n] a random bit Xi where the random vec- 
tor X = (X\, ...,X n ) is uniformly distributed over {0, 1}™ with the restriction that 
y^-, Xi = n — to. Then sort according to digit, preserving relative order for cards 
with the same digit. 

Now letting r be a random permutation in S n with ordered top to to random 
shuffle distribution, Theorem 12.81 allows us to rewrite des(r) or inv(r) in a useful 
way exactly as we did in Corollary |2.3l Namely, we have 

n-l 

des(r) = d ^2l(X l > X l+1 ) and inv(r) = d ^t(X l > Xj) 

i=l i<j 

where X = (X\, ...,X n ) is uniformly distributed over {0,1}™ with the restriction 
that X)"=i Xi = n — to. Hence the problem is transformed into a problem of uniform 
permutations of a fixed multiset which is well studied in the literature. See, for 
example, [6] . We will revisit this at the end of Section [3] 

3. Convergence rates for the number of descents and inversions 

In this section we will discuss the asymptotic normality of the number of descents 
and inversions in riffle shuffles and will provide convergence rates of order l/\/n 
in the Kolmogorov distance. Recall that the Kolmogorov distance between two 
probability measures fi and v on R is defined to be 

d K (p,v) = sup|/i((-oo,z]) - v((-oo,z])\. 

We start with the asymptotic normality of the number of inversions after an a shuffle 
which was conjectured by Fulman in [8] for unbiased 2-shuffles. Our strategy will 
be using Corollary 12.31 to transform the problem into random words language, use 
Janson's U-statistic construction [T^] for the random words case and finally use 
Chen and Shao's results on asymptotics of U-statistics [5]- Before moving on to 
the main result, we provide some pointers to the literature and give the necessary 
background on U-statistics. 

First we note that the asymptotic normality of the number of inversions in ran- 
dom words is recently proven by Bliem and Kousidis [3] without convergence rates 
in a more general framework. In [12] . Janson gave equivalent descriptions of the 
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random words problem and analyzed the asymptotic behavior using U-statistics 
theory. Naturally, the convergence rate result given here will also apply to Janson's 
case. 

Now recall that for a real valued symmetric function h : W" —> R and for a 
random sample X±, ...,X n with n > to, a U-statistic with kernel h is defined as 

u n = u n (h) = 4y E Kx^-iXij 

Km) C m „ 

where the summation is over the set C mi „ of all ( n ) combinations of m integers, 
i\ < %2 < ■■■ < im chosen from {1, n}. The next result of Chen and Shao will be 
useful for obtaining convergence rates in the Kolmogorov distance. We note that 
throughout this paper, Z will denote a standard normal random variable. Also in 
the following statement hi(Xi) := E[h(Xi, X m )\Xi]. 

Theorem 3.1. [5] Let X\, ...,X n be i.i.d. random variables, U n be a U-statistic 
with symmetric kernel h, E[h{X\, ...,X m )] = 0,a 2 — Var{h{Xi, ...,X m )) < oo and 
a\ = Var(h 1 (X 1 )) > 0. // in addition E\h 1 (X 1 )\ :i < oo, then 



fV^ TT A . 6.1EM*i)l 3 , (l + -s/2)(m-l)ff 



\TO(Ti ' / y/naf (m(n — m + l)) 1 / 2 ^ 



d K 




Now we are ready to state and prove our main result on the number of inversions 
in riffle shuffles. 

Theorem 3.2. Let p n ^ a be a random permutation with distribution P n ^ a with a > 2. 
Then 

where C is a constant independent of n. 

Proof. Let a > 2 and p n ^ a have distribution P n . a . Using Corollarv l2.31 we have 
inv{p n , a ) := E l(pn,a(0 > PnAj)) =d E > X ^ 

i<j i<j 

where X^s are independent and uniformly distributed over [a]. This immediately 
yields 



\inv(p ni a)] = E 



E > x ^ 



n\ m , . n(n — 1) a — 1 



Using similar elementary computations one gets 



a = Var{inv(p nia )) 



n 



(n - l)(2n + 5) a 2 - I 



72 a 2 ■ 

See [3] or [TJ] for details. Now following [TJ], we will find a U-statistic representation 
of inv{p n ,a)- All details are included for the sake of completeness. 

Let U\,...,U n be independent random variables uniformly distributed over (0, 1). 
Order L/j's as U a ^ < U^m) < ■■■ < U a r n ) where a € S n is properly chosen. Since a 
has uniform distribution over S n , we have 

(Xi,...,X n ) —a (-X"ct(i), — , AT CT („)). (3.1) 
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Now we get 

inv(p n , a ) = d Y H x i > x j) =d Y > x a(j)) 

i<j i<j 
n 

= 1 ( X «r(i) >X aU) ,i<j). 

where the second equality follows from (|3.1[) . Observing i < j if and only if U a u\ < 
U a fj) , we obtain 

n n 

ira(p„,„) = d Y l ( X °(i) > ^CjJ^W < U <r(j)) = > ^i) 1 ^ < ^i)(3-2) 

Next let Zi = (Xj,f7,-) for i = l,...,n and observe that Z^s are i.i.d. random 
variables. Define the functions / and g by 

f((Xi,Ui),(Xj,Uj)) := \^jl(xi > Xj)l(ui <Uj) 

and 

g((xi,Ui), (xj,uj)) = f((xi,Ui), (xj,Uj)) + f((xj,Uj), (x l ,u i )). 
Then clearly g is a real valued symmetric function and 

n 1 
Y H*k > Xi)l(U k <U l ) = j-Y 9(Zk,Zi). (3.3) 

fc,i=l V2/ fe<; 

Thus, by (|3.2[) and (|3.3[) we conclude that inv(p n , a ) is a U-statistic with 
in«(pn,«) = d Q g Q (IpQ > Xj)l{Ui < Uj) + l(X t < X^I^ > U 3 )). 

So in terms of Theorem 13. 1[ we have h((xi, iti), (x 2) 112)) = (o)k((xi, ui), (x 2 , 112)) 
where 

k{{x\,u{), (x 2) u 2 )) = l(xi > x 2 )t(ui < u 2 ) + l(xi < x 2 )l(ui > u 2 ) - ^— — . 

la 

Defining 

k 1 (x 1 ,u 1 )=M[k{X 1 ,U 1 ),(X 2 ,U 2 )\X 1 =x 1 ,U 1 =u 1 ], 
we have hi{xi,u\) = (™)fci(a;i, Ui). Also 

kxiX^Ux) = E[1(X 1 >X 2 )1(U 1 <U 2 ) + 1(X 1 <X 2 )1(U 1 >U 2 )\X 1 ,U 1 ]-^- 

la 

a a la 

= -(X 1 -2X 1 U 1 + (a+l)U 1 -l)-^^. 
a la 

Now doing some elementary computations, we obtain 

\ 2 / \ 2 2 1 



and also 

3 



E|/ii(Xi,J7i)| d <9 



cS 
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Hence using Theorem 13. 1[ we arrive at 



'A' 



q 2 -l 
36a 2 



which in particular implies the existence of a constant C independent of n as in the 
statement of the theorem. This completes the proof. □ 

Remark 3.3. U-statistics construction given above will still work when the shuffle 
is biased. So under certain conditions on the distribution vector p = (pi,...,p a ) 
(namely, by excluding the case pj = 1 for some j € [a]), one can extend Theorem 
13.21 to the case of biased riffle shuffles (or random words) . 

Remark 3.4. By the nice convolution property discussed in Remark 12. 6[ Theorem 
13.21 also gives convergence rates for multiple unbiased 2— shuffles (with explicit con- 
stants, as can be seen easily from the proof). 

Next we move on to the number of descents in riffle shuffles which is much easier 
due to the underlying local dependence. Recall that, if we define the distance 
between two subsets of A and B of N by 

p(A,B) :=M{\i-j\:i€A,jeB}, 

the sequence of random variables Yi, Yi, ... is said to be m— dependent if {Yi, i G ^4} 
and {Yj, j £ B} are independent whenever p(A, B) > m with A, B C N. Now we 
recall the following result from [4] about m— dependent random variables. 

Theorem 3.5. [4] // {li}i>i is a sequence of zero mean m—dependent random 



varia 



tables, W = Yn=\ Y * and ^[W 2 ] = 1, then for all p € (2, 3], 



d K {W,Z) < 75(10m + l) p - 1 ^E|yi|f. 
Now, letting p na be a sample from P n , a , we know from Corollarv l2.3l that 

— l n — 1 

des(p rha ) = ^ HPn,a(i) > Pn,a( i + l )) =d X! M X i > 
t=l »=1 

where X^s are independent and uniform over [a]. Setting V = £™~i Y i with 
Y = l(Xi > X i+X ), we have E[V] = (n - l) 2 ^- Also since Var(Y t ) = 
and Cov(Y i7 Y i+ i) = — (^ a 12a ^ ^ or * = ^> •> n ~ •"•> we nave 

n—l 2i / 2 1 

Var(V) = J2v*r(Y i )+2j2Cov(Y i ,Y j ) = (n - 1)^_Z_ _ 2 (n - 1) I 

(q 2 -l)(n-f) 
12a 2 

Now noting that Y^s are f -dependent and using Theorem 13. 5 1 wit h p — 3, we arrive 
at 
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Theorem 3.6. Let p n ,a be distributed according to P n , a - Then 




where C is a constant independent of n. 

Remark 3.7. The discussion from Section [2] and a simple coupling argument gives 
the following stochastic dominance result, say, for the number of inversions: 



where a > 2, ir is a uniformly random permutation and < s denotes stochastic 
ordering. Since the means and variances of these three statistics are of the same 
order, it wouldn't be surprising to obtain the asymptotic normality of Inv(p n ^ a ) by 
the corresponding results for Inv{p n .^) and Inv(ir). We will pursue this idea in a 
future work. 

We conclude this section with a discussion of the asymptotic normality of the 
number of inversions after ordered top m to random shuffles which were defined at 
the end of Section [2j We start by recalling a special case of a result of Congar and 
Viswanath [6] on multisets. Let fj £ [1/2,1). Then there exists a constant C > 
depending only on /3 so that whenever r is a uniform permutation of the multiset 
{0™°, l™ 1 } with no, n\ 6 N, no + n\ = n, max{no, n\} < fin, 



is satisfied where p — E[c?es(r)] and a 2 = Var(des(r)) (For details, see [6]). It is 
easily seen from this result and Theorem 12.81 that, one can analyze the asymptotic 
behavior of the number of inversions in ordered top m to random shuffles under 
the assumption that max{m, n — m} < fin. Note that this also suggests a natural 
generalization of riffle shuffles. To see this, consider the case where the number of 
cards in the hands are (no,ni) where (no,n\) is uniform over the set {(no,ni) G 
[n] x [n] : no + ni = n, min{no, ni} > an} for some 1 > a > 0. When a = 0, we get 
Pn.2- Using a > 0, we get a different model which can be meaningful since when 
one shuffles a deck, there will be at least a few cards in each hand. 

4. Another related Statistic : Longest alternating subsequences 

In this section we will study the asymptotic behavior of lengths of longest al- 
ternating subsequences (which are closely related to descents) in uniform permu- 
tations and riffle shuffles. Letting x := (xi)" =1 be a sequence of real numbers, a 
subsequence x ikl where 1 < i\ < ... < i^ < n, is called an alternating subsequence 
if Xi t > Xi 2 < Xi 3 > ...Xi k . The length of the longest alternating subsequence of x 
is defined as 

LA n (x) := maxjfc : x has an alternating subsequence of length k}. 

For an example, let x = (3,1,7,4,2,6,5). Then (3,1,7,2,6,5) is an alternating 
subsequence and it is easy to see that LAi(x) — 6. For an excellent survey on 
longest alternating subsequence problem, see |18j . The following lemma, whose 
proof can be found in [11] and [17], is very useful to understand LA n {x) when x is 
a sequence of random variables. 



Inv(p n ,2) <s Inv(p n ,a) < s Inv(ir) 
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Lemma 4.1. |17| Let x := (xi)™ =1 be a sequence of distinct real numbers. Then 
LA n (x) = 1 + > X2) + # local extremum of x 

n—1 n—1 
= 1 + 1(X1 > X 2 ) + 2_j H x k-1 > Xk < Xk+l) + ^ H X k-l < x k> Xk+l). 
k=2 k=2 

Example 4.2. Let x = (3,1,7,4,2,6,5). Then the local maximums are {x3,xq} = 
{7,6} and the local minimums are {X2, x§} — {1,2}. Noting that xi > xi and 
using Lemma [4.1[ we get LA n (x) = 1 + 1 + 2 + 2 = 6. Indeed, the subsequence 
(3, 1, 7, 2, 6, 5) has length 6 and x does not have a longer alternating subsequence. 

Now we move on to discussing longest alternating subsequence of a uniformly 
random permutation n. In this direction, |11| and |17| find the expectation and 
variance as 

E[L^ n (7r)] = y + i and Var(LA n {7r)) = g - ^. 

They also prove asymptotic normality of LA n (j{) by using an alternative repre- 
sentation and the underlying local dependence. We contribute to their result by 
obtaining convergence rates in the Kolmogorov distance. 

Theorem 4.3. Let tt be a uniformly random permutation in S n . Then for every 
n> 1, 




where C is a constant independent of n. 

Proof. Let tt be a uniformly random permutation and Xi,...,X n be independent 
uniform random variables over (0, 1). Letting 

E k = {X k ^ >X k < X k+1 } U {X fc _x <X k > X k+1 } for k = 2, n - 1, (4.1) 

we have 



LA„(tt) = 1 + 1(tt(1) > tt(2)) + ^ l(7r(fc - 1) > vr(fc) < 7r(fc + 1)) 

k=2 

n-1 

+ ^ l(7r(fc - 1) < 7r(fc) > 7r(fc + 1)) 

k=2 

n—1 n—1 

= d 1 + 1{X X >X 2 ) + Y / l(Xk-i >X k < X k+1 ) + < X k > Xk+l) 

k=2 k=2 
n-1 

= l + l(X 1 >X 2 ) + J2 1 (Ek) (4.2) 



k=2 



where in the second equality, we used the discussion from the Introduction (or see 
[T7] for a precise statement). Now, clearly LA n (n) is a sum of 4-dependent random 
variables and result follows from Theorem 13.51 □ 
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Remark 4.4. Using the representation of LA n (ir) given in (|4.2p . one can easily 
obtain a concentration inequality for LA n (ir) by using, for example, McDiarmid's 
well known bounded differences inequality [13]. By (14.21) . we have 

n-l 

LA n (w) = d f{X x ,...,X n ) := 1 + l(Xi >X 2 ) + Y, l{E k ) 

k=2 

where X^s are independent random variables and E^s are defined in terms of JQ's 
as in (|4.ip . Now by a case analysis, it is easy to see that bounded differences 
property holds with Cfe = 3 for k = 1, n and one immediately arrives at 

P(|£A„(7r) - /i| > i) < 2e- 2t2 / 9 ™. 

Next we will work on the same problem for riffle shuffles. First note that, with 
its close connection to the number of extremum points and number of runs, longest 
alternating subsequences can be quite useful in non-parametric tests. Indeed, our 
motivation here comes from the practical discussions of this issue in [T3] on cheating 
in card games. 

We start by recalling the development of longest alternating subsequences in 
random words given in [TT] . This time we need to be careful about defining maxima 
and minima properly as we may have repeated values in the sequence. We say that 
a sequence x = (x\, ...,x n ) € [a]", has a local minimum at k, if (i) Xk < Xk+i 
or k = n, and if (ii) for some j < k, Xj > Xj+i = ... = x^-i = Xk- Similarly, 
x has a local maximum at fc, if Xk > Xk+i or fc = n, and if (ii) for some j < k, 
Xj < Xj+i = ... = Xk-i = Xk, or for all j < k, Xj — Xk- With these definitions, a 
useful representation of LA n (x) is found by Houdre and Restrepo [H] as 

LA n (x) = # of local maxima of x + # of local minima of x. 

Letting X = (X x , X n ) be a random word where X[s are independent and 
uniform over [a], they also show that 

LA„(X)-n(2/3-l/3a) 

T= >d & 

as n — > oo where 

2 8 ( {I + f/a)(f -3/4a)(l-l/2a) \ 
7 = 45 { l-2/(a + l) ) ■ (4 ' 3) 

(Note there is a typo in [TT] for the expression of 7 2 . This can be checked from 
|13| by taking limits in the corresponding variance formula). Now, Lemma l2.21 the 
discussion just before it with Houdre and Restrepo's result immediately gives 

Theorem 4.5. Let p n a be a random permutation with distribution P n ,a- Then 
LA n {p n . a ) - n(2/3 - l/3o) 

os n -> oo where 7 is as defined in |^T3|). 



>d N(0,1) 



This result can be generalized to biased shuffles as in previous problems in a 
straightforward way. Asymptotic mean and variance of this case are described 
in detail in [TT]. Also note that, due to the lack of local dependence, obtaining 
convergence rates is not as easy as the case of uniform random permutations for a 
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shuffles and it will be studied in a subsequent work. However, when one focuses on 
Pn, 2) one still has local dependence as we describe in the rest of this section. 

The ease of the 2-shuffle case comes from the following proposition which gives 
a characterization of extremum points of 2-shufHcs in terms of the descents. Note 
that this result also gives the asymptotic behavior of the number of local maxima 
or minima with the use of Theorem 13.61 

Proposition 4.6. Let p n ,2,p be a random permutation with distribution P n ,2,p gen- 
erated by inverse shuffling with the random vector X = (X\, ...,X n ) where Xi 's are 
independent with distribution p = (pi,f>2) with <P\ < 1. Then for k — 2, n— 1, 

i- Pn,2,p has a local maximum at k if and only if p n ,2,p has a descent at k. 

ii- Pn,2,p has a local minimum at k if and only if p n ,2,p has a descent at k — 1. 

Proof. i. (=>) Obvious. (<=) Assume ir(k) > 7r(fc + 1). We should show 

7r(fc — 1) < 7r(fc). Since 7r(fc) > 7r(fc + 1), we see that k th card comes from 
the second pile and k + 1 st from the first pile. Now whether card k — 1 
comes from the first pile or the second pile, we have ir(k — 1) < 7r(fc) since 
the relative orders of the piles are preserved, 
ii. Proof is similar to the maximum case and we skip it. 

□ 

Now we are ready to give a useful representation of LA n (p rit 2,p)- First recall that 

n-1 

LA n (n) = 1 + l(p„,2,p(l) > Pn,2,p(2)) + !(Pn,2,p(fc - 1) > Pn,2,p(k) < Pn,2,p(fc + 1)) 

k=2 

n-1 

+ ^ 1 (/°n,2,p(fc - 1) < Pn,2,p{k) > Pn,2,p( k + !))■ 
k=2 

Using Proposition ^. 61 we obtain 

n-1 n-2 

LA n (p n>2tP ) = d 1 + 1(X 1 >x 2 ) + Y / MX, > x l+1 ) + M x i > x i+i) 

i=2 i=l 

where X^s are independent with distribution p = (pi,p2). This immediately gives 

LA n {p n , 2 , P ) =d 2 (j^ MX, > X l+1 )^J + l{X n -i < X n ). (4.4) 

By the representation in (|4.4|) . it is clear that we still have local dependence 
for LA n (p n ,2,p) and thus, we can still use Theorem 13.51 with p — 3 to obtain a 
convergence rate of order 1/y/n for LA n (p nt 2, P )- 

5. Concluding Remarks 

In this note, after relating riffle shuffle statistics to random word statistics, we 
were able to obtain asymptotic normality results with convergence rates for the 
number of descents and inversions after an arbitrary number of a-shuffles. Through- 
out the way, we also discussed how similar ideas can be used for a variant of top 
m to random shuffles and provided small contributions to Houdre and Restrepo's 
work on longest alternating subsequences. 

In subsequent work, we will provide convergence rates for the length of longest 
alternating subsequences in a-shuffles for a > 2. We also hope to find out a general 
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framework for establishing the asymptotic normality of a large class of a— shuffle 
statistics. One possible direction for this can be using the stochastic dominance 
idea introduced in Remark 13 .71 as in many cases it can be easier to prove the results 
for 2-shuffles and uniformly random permutations. 
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